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Abstract 

We give a general framework implementing the Second Moment Method on k-SAT and discuss 
the conditions making the Second Moment Method work in this framework. As applications, we 
make the Second Moment Method work on boolean solutions and implicants. We extend this to 
the distributional model of k-SAT. 



1 Introduction to the Second Moment Method 

Just as the First Moment Method is a way to upper-bound the threshold of k-SAT, so is the Second 
Moment Method a way to lower-bound it. After a brief technical introduction to the Second Moment 
Method in section |1.1[ we present in section |1.2| a survey of the early attempts to lower-bound the 
threshold of 3-SAT through the Second Moment Method. As in the First Moment Method, the general 
idea is to count special kinds of solutions. However, the selection of solutions is not the same as in the 
First Moment Method: 

1. in the First Moment Method, we considered random variables X such that satisfiability implies 
X > 1; setting X to be the number of solutions yields an upper bound of 5.191 for the threshold 
of 3-SAT; and we tried to select the least solutions; 

2. in the Second Moment Method, we shall consider random variables X such that unsatisfiability 
implies X = (see section |1.1[ ) ; setting X to be the number of solutions yields a lower bound 



of for the threshold of 3-SAT (see section 1.2.11; here the criterion to select solutions is quite 



different: we are interested in subsets of solutions having low variance. 



In a breakthrough paper, Achlioptas & Peres - 2004 [2] succeeded with the Second Moment Method 
on k-SAT, establishing a lower bound of 2.68 for 3-SAT and an asymptotically tight lower bound of 
2 k In 2 — O (k) for k-SAT. It turns out that the currently best lower bound of the 3-SAT threshold (3.52) 
was obtained by another way: analyzing algorithms finding solutions with high probability, cf. Kaporis, 
Kirousis & Lalas [5] and Hajiaghayi & Sorkin [B]. 

The purpose of the next chapters is to make the most of the Second Moment Method on k-SAT. 
To do so we take a different approach from Achlioptas & Peres'. In our framework we select solutions 
according to the fraction of variables assigned 1 and the fractions of the different types of clauses 
(i.e. the number of true literals occurrences in the clauses). This framework is general enough to 
include boolean solutions, implicants etc. However, tuning our parameters the best way we could, we 
got numerical evidence that we could not obtain better lower bounds than 2.83. 

The stumbling block we recurrently encountered is what we call the independence point. It cor- 
responds to the couples of independent solutions in the subset of selected solutions. Even though 
solutions are independent, the proportion of literals occurrences having a certain truth value may not 



be independent between solutions. We got numerical evidence that the Second Moment Method does 
not work if truth values of literals occurrences are not independent. On the other hand, when they 
are independent, we give a necessary condition for the Second Moment Method to work, taking into 
account just the exponential equivalent of the second moment at this point. This condition tells us 
that we must select solutions having equal true and false surfaces (the surface is just the total number 
of literals occurrences), which is very artificial with respect to what we can observe with SATLab [7]. 
Using this condition, we could make the Second Moment Method work numerically; however, since 
the lower bounds we get (2.83) are far below the currently best lower bound (3.52), we do not give a 
rigorous (and tedious) proof of our lower bounds (to do so, the exponential equivalent would not be 
enough, and calculations would become quite involved). 

The very restrictive conditions we encountered to make the Second Moment Method work may 
be due to some weaknesses of our framework. We do not claim that the Second Moment Method is 
doomed to perpetual failure on k-SAT. We only hope to shed a small ray of light onto it. This work is 
very fresh, still in progress, and has not been published. 

In section [2] we present a general framework implementing the Second Moment Method on random 
k-SAT under various settings (distributions of signs, implicants. . . ). Section[3]is dedicated to a variation 
of the framework presented in section [2] for the needs of distributional models. 



1.1 How the Second Moment Method Is Supposed to Work 

Let us recall how the Second Moment Method is supposed to work: given an event A, we want to show 
that Pi (A) tends to 1 but we don't have access to Pi (A). Instead we use the first and the second 
moments of a non-negative random variable X such that Pr (A) > Pi (X > 0), i.e. X must be when 
A does not hold. For our problem 3-SAT, A is the event "a formula is satisfiable". The simplest choice 
for X is of course the number of solutions. 

1. The first thing to notice is that if we show that Pr (A) is lower-bounded by a positive constant, 
then it tends to 1. Why? Because Friedgut & Bourgain's theorem j5] established a sharp 
threshold for random k-SAT; 

2. In order to prove that Pr {X > 0) is bounded away from zero, we use the following classical 
identity: 

Pr(X>0) > . (1) 

To prove it, use the fact that EX — E (Xlx>$) since X > 0, and apply the Cauchy-Schwartz 
inequality to it: (EX) < EX 2 El^- >0 . So in particular note that 

(EX) 2 

w * 1 w 



Just as the first moment is fairly easy to compute, so is the second moment. Let X be the number 
of assignments having some property V (V might be "be a solution" or "be a black and red polka-dot 
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In the case of satisfiability however, we are going to see that the Second Moment Method is much 
more difficult to implement than the First Moment Method. The reason is that in general EX 2 tends 
to be exponentially greater than (EX) 2 , so equation [l] just says that Pr (X > 0) > 0, which is not very 
informative. . . Thus the challenge is to find out a set of solutions having low variance. 

1.2 Use of the Second Moment Method for Lower-Bounding the Threshold 



In this section we make a survey of different applications of the Second Moment Method to lower-bound 
the threshold of random k-SAT. The model considered here is uniform random drawing of k-SAT. 

1.2.1 Second Moment of Solutions 

Here is a perfect example of the failure of the Second Moment Method. Namely the lower bound 
obtained by the Second Moment Method of solutions is c = 0. If X is just the number of solutions, 
then EX — 2" (l — t^t) . So let us now compute the second moment. 

To do so we need an extra parameter /i, representing the proportion of variables changing values 
between two solutions. 

1. total number of couples of assignments: 

(a) choose the value of variables assigned or 1 in the first assignment: 2"; 

(b) choose the subset of variables assigned different values in both assignments: ( " J ; 

2. probability for a couple of assignments to be a couple of solutions: as noted by Achlioptas & Peres 
[2], it is easier to compute the probability that a clause breaks two given assignments, since it is 

(i^) fe . Using then the fact that Pr (A ("I B) = l-Pr(IuB) = 1 -Pr (3) -Pr (£?) +Pr (InB), 
it follows that the probability for a clause not to break any of both assignments is g (//) = 



of k-SAT 




Thus the second moment is: 




[MEN 



Let us look at the exponential equivalent of this quantity: 
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As explained by Achlioptas & Peres [5], it turns out that when pt = ~, this is precisely (EX) 2 . 
Now the function /i i— > ^i-h nas its maximum at /i = g , whereas g (/i) is strictly decreasing over 

(0, 1). Consequently, at any positive ratio c, the maximum of ^t^zjap 9 (m) C occurs at fi < ^ and EX 2 

is exponentially greater than (EX) . So tends to zero, and we only get that Pr (X > 0) > 0. . . 

1.2.2 Balancing True and False Surfaces 

Achlioptas & Moore - 2002 |T| noticed that g (fj,) is locally maximal at fJ, = \ in k-NAE-SAT because in 
this problem g (/i) = 1 — + ^ ^(1 — ji) k + /i fc ^ is symmetric in /i. We recall that in NAE-SAT, when 
an assignment is a solution, then the opposite assignment is a solution as well. Thus this problem 
contains some symmetry. Using this remark, Achlioptas & Moore were able to establish a tight lower 
bound on the k-NAE-SAT threshold. And since a NAE-SAT solution is a solution of standard SAT, 
they got the following lower bound of the k-SAT threshold: 2 fc_1 In 2 — O (1). 

Achlioptas & Peres - 2004 [2| put some weights onto the solutions of standard SAT and got a lower 
bound of 2 fe In 2 — O (k). (thus almost matching the asymptotic upper bound of 2 k In 2). The weights 
they put are of the form A tmcsurface lsAT, where the true surface is the number of occurrences of true 
literals under the solution. Assignments which are not solutions must be discarded because the Second 
Moment Method requires to count when there is no solution, as explained in section |1.1| In the 
particular case of 3-SAT they got a lower bound of 2.54 (and even 2.68 with a refinement). 

We are going to implement the Second Moment Method without any weights; so how shall we 
control the balance of true and false surfaces? Our control parameters will be (3t, the fraction of 
clauses having t true literals. Then the true surface will be (3i + 2/32 + 3/?3 and the false surface will 
be 2/?i + /?2- With this parameters we are able to make the Second Moment work. However, we are 



not able to achieve a better lower bound than 2.833, see section 2.6.1 

Moreover our approach is quite general and enables us to make the Second Moment Method work 
on implicants as well. 



A General Framework for the Second Moment Method on 
k-SAT 



Here we present a general framework for the Second Moment Method on k-SAT. Section 2.1 introduces 
all ingredients we need: values, signs, truth values, types of clauses and surfaces. Then in section |2~2"] 
we give the expression of the first moment of the solutions under these settings; the expression of the 
second moment is given in section [2~3| Bringing together the second moment and the constraints, we 
use the Lagrange multipliers method in section |2.4| 

One point in the space of the variables is very important in the Second Moment Method: this is 
what we call the independence point. It is important because it makes = 1 (see conditions in 



theorem [5] of section 2.5 1. Thus if we want the Second Moment to work, we must be careful that this 



point should be stationary. 



We apply this general framework to boolean solutions (section 2.6.11 and to implicants (section 



2.6.21 



We discuss the relevance of the Second Moment Method for lower-bounding the k-SAT threshold 



in section 2.7 where we use SATLab to confront the theoretical requirements we obtained with reality. 



2.1 Preliminaries 
2.1.1 Values 

First of all we have n variables. An assignment gives each variable a value taken from a given domain 
D: 
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• in the case of boolean satisfiability, D — {0, 1}; 

• in the case of implicants, D = {0, 1, *}. 

Given an assignment, for all a g D, we denote by S a the proportion of variables assigned value a: 

E = 1 ■ ( 3 ) 

Given two assignments Si and S2, for all (a, b) € -D 2 , we denote by /i a j, the proportion of variables 
assigned value a in Si and value b in Sa: 



<5 6 

A, 



(4) 
(5) 



So we are going to consider some of the /i a ,b's as functions of the other ones, assuming that equations 
[4] and [5] are satisfied. We shall refer to the remaining fj, a .b's as a generic variable /j,, cf. section 2.4.2 



2.1.2 Signs 

In a k-CNF formula with an clauses, we have ken occurrences of variables, each having a sign s G S. 
In the case of boolean satisfiability as well as in the case of implicants, S = {+, — }; 
For all s € S, we denote by p s the proportion of occurrences having sign s: 



ses 



(0) 



2.1.3 Truth Values 

The combination of a sign and a value yields a £ru£/i value v € V. Here is an example of a classical 



truth table with V = {T, F, *}: 





+ 









T 


1 


T 


F 









We use the following notation, for a £ D, s £ S and »eV 



1 if value a and sign s yield the truth value v ; 
otherwise . 



Of course one sign and one value yield exactly one truth value: 

^ . Xa,s,v — 1 ■ 



vev 



So we shall denote byo®s the unique v such that Xa,s.v = !• 

Given an assignment, for all v S V, we denote by 77^ the proportion of literals occurrences having 
the truth value v: 



^ ^ Xa,s,v&aPs 



a£D 

ses 



Given two assignments Si and S2, for all (v, i») £ V 2 , we denote by £ v<w the proportion of literals 
occurrences having the truth value v in Si and the truth value w in 



^ ' Xa.s.v 



Xb,s,wHa,bPs 
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2.1.4 Clauses Types 

A clause type is an element of V k , e.g. TTF or *TF. Some types of clauses will be forbidden, such as 
FFF and FF*. We denote by T the set of allowed types of clauses. T is a subset of V fc . 

• in the case of boolean solutions of 3-SAT: V = {T, F} and FFF is forbidden; 

• in the case of implicants of 3-SAT: V = {T, F, *} and the allowed types of clauses are those 
containing at least one T. 

Given an assignment, for all t G T, we denote by f3 t the proportion of clauses of type t (which is zero 
for all forbidden types of clauses): 

£a = 1 ■ ( ? ) 

ter 

Given two assignments S\ and 5*2, for all (t, u) G T 2 , we denote by j t u the proportion of clauses 
of type t in solution S\ and of type u in solution S% '■ 

E^« = Pu ; (8) 

ter 

E ^« = & • ( Q ) 

«er 

2.1.5 Surfaces 

Given an assignment, the surface occupied by a truth value v is obtained by summing all occurrences 
of v in the different types of clauses: E„ = X^ter ^* £i=i lt»=w 

Given two assignments S\ and S2, the surface occupied by a couple of truth values (v, w) is obtained 
by summing all occurrences of (v, w) in the different types of clauses: S V)ttl = u ) S 7-2 7t,u I2i=i lt*=i>A'. 

Surfaces are normalized to fc because the fit's sum up to 1: 

Fact 1. X^ev — ^ an <^ ^2<v,w)eV 2 ^v,™ = k. 
Proof. 

= EE&X>«- 

•uev uevter i=i 

= E^EE 1 ^ 

ter i=i uev 

- E&E* 

ter z=i 

= kj2?t 

teT 

= k by constraint [7] 

The same proof works for the other sum, using constraints [8] and [7J □ 

2.1.6 Symmetry of Occurrences 

We say that there is symmetry of occurrences when for all permutation a of V , fit = Ar(t) (it follows 
that T is closed by permutation) . 



G 



Fact 2. Symmetry of occurrences implies that Y< v = k X)teT ^-ti=vflt- 
Proof. Let us call o~i the permutation of V k swapping t\ and i,-. 



*er i=i 

i=l *GT 
fe 

^ E 1 o-i(*) 1 =o^o'i(t) b y symmetry of occurrences 



»=i teT 
fc 



l CT .( t ) i=t ,/3 ..( t ) because T is closed by permutation 

*=i ui(t)eT 



teT 



Fact 3. 7/7t )U = ft|5„, symmetry of occurrences implies that kE 
Proof. Let us call Oi the permutation of V k swapping <i and t%. 

k 

(t,«)er 2 »=i 

= k 22 PtPu ^ 1-u=v1-u t =w by independence 

(t,«)eT 2 »=i 
fc 

= &E E ftiW^titjliTiWi^^iW!^ b y symmetry of occurrences 

l=l (cr i (t),<T i (u))er 2 

= k 2 /3 t /3 tl l tl=l) l Ul=lu because T is closed by permutation 

(t,«)GT 2 

= S^E^ by fact [2} 



□ 



□ 



Symmetry of occurrences is quite natural and will be assumed from now on. Note that 5 a 's and 
jSt's are parameters of the first moment, so they may be chosen without any restriction, except that 
they must sum up to 1. They are our control parameters: we can tune them as we wish in order to 
take into account only some solutions. However, when the set of solutions defined by 5 a 's and /3 t 's 
is determined, we have to consider all possible couples of solutions. So the variables fi a ,b's and 7t,«'s 
of the second moment may not be chosen, but result from a maximization process, as investigated in 
section 12.41 



2.2 Expression of the First Moment 

The first moment of the number X of solutions can be split up into the following factors: total number 
of assignments and probability for an assignment to be a solution. 

1. total number of assignments: choose subsets of variables assigned a € D: ( ^ n " ). 
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2. probability for an assignment to be a solution: 

(a) we give each clause an allowed type t G T: ( , R cr \ )■ 

(b) probability for clauses to be constructed (variables + signs) according to their types: 
Titer (ULiVu) 

We denote by V the set of all families of non-negative numbers ((S a ) al - D , (/?t) tg T-) satisfying constraints 
[3] and [7] We denote by X (n) the intersection of V with the multiples of A ; we get the following 
expression of the first moment: 

EX = J2 T i( n ) 

((5 a ) aeD ,(A) ter )ei(n) 

where 



f k \ cn 

71 \ I cn Wtttttt It =^/3t \ 



n \ / cn 

)ter--v \ veV 



?i \ / cn 

(^ n )oe-D---/ V • • (Ptcn) ter . 



u>ev / 

The exponential equivalent of Ti (n) is T™, with the following two equivalent forms: 



%eD" a \teT 

lev ''u 



1 / Iluev 



2.3 Expression of the Second Moment 

The second moment of X can be split up into the following factors: total number of couples of 
assignments and probability for a couple of assignments to be a couple of solutions. 

1. total number of couples of assignments: ( , ); 

2. probability for a couple of assignments to be a couple of solutions: 

(a) we give each clause an allowed type t G T in solution S± and another u G T in solution S2 : 

I cn \. 

V...(7 t ,„cra) (t)<l)er2 ..J' 

(b) probability for clauses to be constructed (variables + signs) according to their types: 
U(t,u)eT 2 (JIi=i £ *«,«i y 



<s 



We denote by V2 the set of all families of non-negative numbers ((fia,b)( a b)^o 2 > (7*.«)(t «)eT 2 ) sa *i s " 
fying constraints [4J [5] [5] and [9I We denote by T% (n) the intersection of V2 with the multiples of ^ ; we 
get the following expression of the second moment: 

EA 2 = Yl T ^ n ) 

((/ i o,6)( a ,i,) S D2,(7t,-u) (t ,„) 6T 2)e2;2(n) 

where 



To(n) 



n \ 1 cn 

(^.bn) {a b)eD2 . . J V ■ • (lt,ucn) (t u)er2 

n \ ( cn 

(^■b n )(a,b)eDi ■••/V- (Tt,«cn) (tu)GT ^ 



11 (11 -V 

,(t,«)er 2 \j=i / 



n n n * 

,((,n)6T 2 «=i (D,iu)ev 



cn 



(Pa,bn) (a b)eD2 ...J\... (7t,«cn) (t u)GT2 . . 7 1 ( ^ 



(Pa,bn) (a b)(iD 2 ■ ■ ■) V- ■ • (7t,ucn)( t ,„) e r 2 ' ' ■) l^J^v 2 
The exponential equivalent of T2 {n) is T^ 1 with the following two equivalent forms: 



1 ( TT flli=i £ ti,ui\ 



n 



ll(a,b)eD 2 V a ,b \( t ,u)eT 2 \ 7t '" / 

n(u,,«)Gv 2 £ «.« \ 

ri(t : «)er 2 7t,u / 

2.4 Expression of the Lagrangian 

When the parameters of the first moment (i.e. (S a ) aeD , (/3t) tg - 7 -) are chosen, T 2 must be maximized 
under constraints [4] [5] [8] and [9] That leads us to use the Lagrange multipliers method. 

As explained in section [2. 1 . 1 1 we are going to consider some of the fj, a ,b's as functions of the other 
ones, assuming that equations |4| and [5] are satisfied. We shall refer to the remaining /i a ,b's as a generic 
variable /i. So we define the following Lagrangian: 

A = - ^ ^ Qifc ln^^-c ^ 7t,«hi— + c X! ^v,w^£v,w 

{a,b)£D 2 ' {t,u)eT 2 ' (v,w)eV 2 

+c]T (In / t ) (^ 7t) „-ft) +c ^(ln ffu ) (^ 7t ,«-AJ • 
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2.4.1 Derivative with respect to j tj 



<9A 

= -cln^ tu + c}lnet i ,u i +chi.ft + c]iig. u 



Canceling out this derivative yields: 



It,- 



2.4.2 Derivative with respect to /i 



= ft9uY[s tuUi . (10) 



i=i 



9A \ - d A dfi a . b 

{a.b)£D z 



— — ln// a , 6 + C ^ 2^ £ 2^Xa,s,vXb, S ,wp S 

(a,b)eD 2 ^ (a,b)£D 2 ^ (v,w)eV 2 V ' W sES 

E9^a,b , 9fl a h\~^ ^a®s,b®s 

— -In^ + c ^ ^ Ps ^^ . (ii) 



(a,fc)GD^ ^ (a,b)£D 2 °^ SS S £ 



Canceling out this derivative is somewhat tricky in general, so we are going to focus on some simple 
particular cases and otherwise end up calculations numerically with Mathematica. . . 
So let us consider first a particularly simplifying case, i.e. when e v w = T) v r) w . 



More 



2.5 Independence Point - Discussion about e V)V] 

We define the independence point in the polytope 7^2 to be the point where fi a ,b — S a 5 b and 74 

This point is of major interest because it make ^ = 1 if e v w = r\ v r\ w (see theorem 

1 

surprisingly, it turns out that there seems to be a dichotomy in the success / failure of the Second 
Moment Method, regarding e v<w at the independence point: 

• if s v w = r) v r) w then we are able to find a necessary and sufficient condition on the first moment 
parameters for the Second Moment Method to give a non trivial lower bound in all models we 
considered: 



boolean solutions, see section 2.6.1 



— implicants, see section [2. 6. 2| 

— distributional model, see chapter [3] 

• otherwise numerical calculations give us empirical evidence that the Second Moment Method 
fails to give any non trivial lower bound to the threshold; indeed even when we have almost this 
identity, the ratio |§ is strictly greater than 1 for any positive ratio c (see figures at the 

end of section 2.6.1 and figure 12 in chapter [3]). 



Conjecture 4. The Second Moment Method works only if e VfW = r\ v r\ u 
This conjecture echoes the following theorem. 
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Theorem 5. At the independence point (i.e. fi a b = $a&b an d Jtu = fitfiu), tf £v w — VvVw; then 
Ik - i 

1 1 

Proof. Let us recall that 



^1 ri(a : 6)G_D 2 M a ,& 

So at the independence point: 




(XheD sS a a ) (il 

T i 2 = n 



(a,b)S-D 2 



(<Ub) 



<5„<5h 



(rw/^) (iw 0£O n (t ,„ )e r= ((nti (niu ^ 



V 



n 



(t,«)GT 2 



(A At) 



A/3„ 



liter 11^1%, 



To show that this quantity is indeed 1, we shall use the following fact: 
Fact 6. Let X and y be two finite sets. We assume that ^2 x€X e x — Ylyey fy = 1- 
Then U, !r . x y (b x c y ) e ^ = (ft,, x //, ) (iL, ' ^ 
Proof. 



n ( b * 

(x,y)£Xxy 



n ^ ii ^ 



xex y ey 



n 



□ 



Now to prove that the previous ratio ^ is 1. it suffices to apply fact |6j 3 times: 
with X = y = D, b x = e x = S a and c y = f y = 5b, which is possible thanks to equation [3| 
with X = y = T, b x = e x — f3 t and c y = f y = /3 U , which is possible thanks to equationl7j 
with X = y = T, b x = Y\i =1 r] ti , e x = f3 t , c v = rii=i ? 7«« and fy = Au which is possible again 
thanks to equation [7] □ 

Moreover, it turns out that the independence point satisfies constraints [4] [5j [8] and [9] thus T2 must 
be stationary at the independence point if we want the Second Moment Method to work (because |§ 



must not exceed 1 if we want to avoid the pitfall we encountered in section 1.2.1). Thus we have the 
following necessary condition to make the Second Moment Method work: 



<9A 

— — = at the independence point. (12) 



Remark 7. We show rigorously only the fact that the above conditions (e v<w = r\ v r\ w and equation 12 1 
are necessary to make the Second Moment Method work, but not that they are sufficient. This would 
require to handle the polynomial residues of the multinomials, and the complete expressions of EX 2 
and (EX) 2 . Since we show only negative results (i.e. bad lower bounds), this tricky part is omitted. 
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2.6 Applications 
2.6.1 Boolean Solutions 
Preliminaries. 

1. the domain of values is D — {0, 1}; given a solution, we call 5 the fraction of variables assigned 
1; constraints [4] and [5] become: 



so, if we define p = jiio i, we have 



S 
6 

1-8 
1-8 



A*o,o 
Mi,o 
Mo,i 



+ Mi,o 
A*o,o + A*o,i 



5- 
1 - 

/'• 



•<J- 



2. the set of signs is S* = {+,—}; we call p the fraction of positive occurrences 

3. the set of truth values is V = {T, F} and the truth table is 





+ 







F 


T 


1 


T 


F 



so 



£t,F 



: £f,T 
St,T 
£f,F 



^ + (1 - p) (1 - 5) 
(l_p)* + p(l-*) 



r) T - fi 
Vf - H 



(13) 
(14) 
(15) 



4. the set of allowed types of clauses is T = D k \ {F k }. 



Condition for e ViW = r\ v r\ w at the Independence Point. The first thing to notice is that if 
£t,f = VtVf, then the three other identities follow, because st,f + £t,t = Vt etc. 
At the independence point, we have fj, = 8 (1 — S). Thus 

e T ,F-VTVF = S(l-5)-(pS+(l-p)(l-8))((l-p)S + p(l-5)) 



5(l-6)-p(l-p) (8 2 + (1 - S) 2 ) [p 2 + (] 
2p(l-p)S(l-5)-p(l-p) (<5 2 + (1 - 5) 2 ) 
-p(l-p)(25-l) 2 . 



pf) 6(1-6) 



Consequently, £t.f < VtVf, with equality iff p G {0, 1} or 6 = ^. 

We discard the particular case of p £ {0, 1} (which corresponds to monotone k-SAT, always trivially 

satisfiable). It turns out that as soon as 8 ^ i, we could not make the Second Moment Method work: 

our numerical attempts revealed that the ratio S is strictly greater than 1 for any positive ratio c. 

1 1 

On the other hand, when 5 — s we could make the Second Moment Method work, as follows. 
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Condition for the Second Moment Method to Work at S = |. As mentioned in section 
stationarity of the independence point implies that ^jj = 0. 
Using equation [TT] 



2.5 



OA 



E 

(a,b)£_D 2 



In : — h c 



= In 



Mi,iMo,o 
Mi, 0/^0,1 



£T,F 



£F,T 



^T.F 

£T,F 
_ 2t,T 
St,T 



^F,T 
£F,T 



1 — -T.T 



^F,F 



Et,T £f,f 



At independence 



10,1 <5i^o<5o«5i 

independence of surfaces (cf . fact |3| : 

<9^ 



Canceling out this derivative yields: 



Since we assume 6 = g , we have ?7t 



= 1; moreover, assuming symmetry of occurrences, we may use 



_3l 

Vf 
Vt 



^Jrp 

Vt 



' VFVT 



Vf 



St* 
Vt 

Vf = 



Vf 



thus E T = £ F = §. 



It turns out that this condition is sufficient to make the Second Moment Method work, and nu- 
merically we found a critical ratio c = 2.833 for Ptff — Pftf — Pfft — 0.197633, Pttf = Ptft — 
Pftt = 0.104733 and Pttt = 0.0929. It is noticeable that this critical ratio is the same for any value 
of p: this comes from the fact that laying down t]t 



If 



equations 



equation 23 has no dependence in p. 

The First Moment Method applied with these settings (i.e. 



13 



14 



and 



15 



imply that 



\) yields a critical ratio of 3.783 



when Ptff = 0.191, which means that such balanced solutions disappear far below the conjectured 
threshold ratio of 4.25. Thus we would like to evade the 8 = h condition. 

Moreover SATLab enables us to see that real solutions do not have 5 — h, see our discussion in 
section 12.7.11 

Attempts to evade the S = | condition. We plot lnln Ji| for different values of S and p, at a 
point satisfying our constraints JsJ and |9j The expected value is — oo iff J§ = 1. We set the ratio c = 0.1 
(to be compared with 2.833, where 5 = J works). 

1. setting Yjt = £f = f : only 5 = \ seems to make Ji| = 1, cf. figure jlj 

2. setting £ F = krjx and £ F = krjp: once more, only S = | seems to make J§ = 1, cf. figure |2j 
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2.6.2 Implicants 

An implicant is a partial assignment such that every assignment of the non-assigned variables will yield 
a solution. We represent the non-assigned value of variables by a *. We performed the calculations on 
implicants with the hope that their variance might be lower than the solutions'. 

Preliminaries. 

1. the domain of values is D = {0, 1, *}; given a solution, we call S the fraction of variables assigned 
1 and a the fraction of variables assigned *; constraints [4] and [5] become: 





6 




Mm 


+ Mi.o - 


- Mi,* 




6 




Mm 


+ Mo,i - 


- M*,i 




a 




M*,i 


+ M*,o - 


/^* 5 * 




a 




Mi,* 


+ Mo,* - 


^i* ^ * 


1-6- 


- a 




Mo,o 


+ Mo,i - 


- Mo,* 


1-8- 


- a 




Mo,o 


+ Mi.o - 


-M*,o = 


= MM) 7T 


— Mi,* 


and 7r 


= M*,i) 


we have 


Mi,o = 


5- 


v - 


- 7T 






Mo,i = 


6- 


v - 


- 7T 






M*,o = 


a — 


M 


— 7T 






Mo.* = 


a — 


M 


— 7T 






Mo,o = 


1 - 


26 


- 2a 4 


- M + v 


+ 7T + 7T 



2. the set of signs is S = {+, — }; we call p the fraction of positive occurrences; 

3. the set of truth values is V = {T, F, *} and the truth table is: 





+ 







F 


T 


1 


T 


F 




* 


* 



and 





— pv + 


(1- 


£t.F 


= m(*- 


- v — 


£f,T 




— v — 


£f,f 


- K 1 


-28 




= M 




£t,* 


= /97T + 


(1- 


£*,T 


= piT - 




£_F,* 


= p(a 


- M - 


£*,F 


= P\a 


- M - 



t?t = /0<5 + (1 — p) (1 — 5 — a) 
7y F = (l-p)6 + p(l-6-a) 
V* = a 



l-25-2a + p, + i' + n 



(1-m)t ; 
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4. a clause type is allowed iff it contains at least one T. 



Condition for e V}W — r\ v r\ w at the Independence Point. The independence point is defined by 
/i = a 2 , v = 8 2 and 7r — it = aS. So at this point we have: 

£t,t = pS 2 + (1 — p) (1 — 5 — a) 2 
Et,f = £f,t = 8 (1 — 8 — a) 

Sf.f = p(l-5-aj 2 + (1- p)8 2 

2 2 

£*,* = ol = V* 
e T ,* = £* : t = ot (pS + (1 — p) (1 — 8 — a)) — r\Tf}* 
£f,*=£*.f = a{p(l — 5 — a) + (1 — p)8) =T}fV* ■ 

The first thing to notice is that all identities involving * satisfy e v>w = rj v r] w . The second thing to 
notice is that if St,f — VtVf, then the three remaining identities follow, because £t,f j t£t.t+£t.* — Vt 
etc. 

e t ,f-VtVf = 8(1 - 8- a) - (pS + (1 - p) (1 - 8- a)) ((1 -p)S + p(l - 8- a)) 

= 8 (1 - 5 - a) - p (1 - p) (s 2 + (1 - 8 - a) 2 ) - [p 2 + (1 - p) 2 ) 8 (1 - 8 - a) 

= 2p (1 - p) 5 (1 - 5 - a) - p (1 - p) (8 2 + (1 - 8 - a) 2 ) 

= -p(l-p)(28 + a-l) 2 . 

Consequently, £t,f < VtVf, with equality iff p £ {0, 1} or 8 = ^f L - 

We discard the particular case of p £ {0, 1} (which corresponds to monotone k-SAT, always trivially 
satisfiable) . It turns out that as soon as 8 ^ we could not make the Second Moment Method 

work. 

On the other hand, when 8 = we could make the Second Moment Method work. 

Remark 8. Setting a — corresponds in fact to solutions (which are some trivial implicants), thus 
to some extent we get back to the 8 = | condition. But is there a positive a yielding a better lower 
bound than a = 0? 

Condition for the Second Moment Method to Work at 8 — ^p. As mentioned in section 
stationarity of the independence point implies that = §^ = = J^r = 0. 
Using equation |TT) i.e. 



2.5 



<9A \ - dp atb \ - dpa.b V' 

(a,b)£D 2 (a,b)£D 2 sGo 
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we get: 

dA M*,oMo 
In 



dp M* : *Mo,o 
+cf(l-p) 



"T,T , ■=■*,* "T,* ^*,T \ , / "F,F , ■=*,* "F* =*,F 
+ ■ +p[ — + 

E T ,T V*,* £ T,* £*,T J \ £ f,f V*,* £ F,* £*,F 

"Bt.T , ^F.F ^T.F ^F,T 



dA M1.0M0.1 

-7T- = In — + c , 

Mi,iMo,o \£t,t £f,f £t,f £f,t 

dA Mi.oMo,: 
In 



37T Pl,*P0,0 



dA Po,iP*,o 

7w = ln 

07r M*,iA*o,o 

+cf 



"T,T , ^F,* =F,T =T,* \ , / '— 'F.F . "7> =T,F =F,* 

+ : +/3 + 

£t,T £f,* £f,T £t,* 1 \ £f,F £t,* £t.F £f,* 



"T,T , "*,F 'T,F "*,T \ , / 'F,F , "*,T 'F,T "*,F 



£t,T £* : F £t,F £*,T / V £ FF £*,T £ft £*,F 

We consider the independence point, thus p a ^ — 8 a 8b- Assuming symmetry of occurrences, we may 
use independence of surfaces (cf. fact|3|; moreover, using condition e V}W = rj v rj w , we get that: 

dA 
dp 

dA 
dv 

dA OA ( s /E T E F \ /E T E,\ /E F E T \ /E F E„ 




dn dir' \ \Vt Vf J \Vt V* J \Vf Vt J \ Vf V* 

Canceling out these derivatives yields: 



E* E F E F 

V* Vt Vf 

Since we assume 5 ~ we h&ve Vt — Vf — moreover r/* = a. Thus E F = E F = k^^ and 
E* = ka. 

It turns out that this condition is sufficient to make the Second Moment Method work, and nu- 
merically we found the critical ratios laid in table [l] for standard 3-SAT with symmetry of occurrences 
at p = \. 

These values are to be compared with those of Boufkhad & Dubois - 1999 [3 , who proved for 
example that at the ratio c = 4.5, any satisfiable instance will have prime implicants with a = 0.01. 
Combined with the lower bound of 3.52 of [8] and [B], this proves that such implicants exist almost 
surely when c < 3.52. 

Thus in the range c £ (2.81,3.52) Boufkhad & Dubois prove that implicants with a — 0.01 exist. 
However, in the range c < 2.81 the Second Moment Method enable us to establish the existence of 
implicants with a a significantly greater than Boufkhad & Dubois's. 

How can we interpret the fact that the critical c obtained decreases with a? Looking at the set of 
allowed types of clauses: 

T = {TTT, TTF, TFT, FTT, TFF, FTF, FFT, TT*, T *T, *TT, T * *, *T*, * *T} 
1>{TF*,T * F,FT*,F *T,*TF,*FT} , 
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Table 1: Critical ratios c of implicants obtained for a given a (and the corresponding choice of the free 
P parameters). 



a 


c 


Ptff 


pTTF 


/3t** 


ratio c of [3] 


0.001 


2.81 


0.195 


0.10867 


3.33 x 10~ 5 


4.5 


0.01 


2.77 


0.1942 


0.098833 


3.33 x 10- b 


4.5 


0.05 


2.52 


0.17767 


0.08167 


0.001633 


2 


0.08 


2.32 


0.1633 


0.07 


0.00233 


1.5 


0.11 


2.13 


0.1533 


0.05467 


0.0033 


1 


0.15 


1.88 


0.13833 


0.041 


0.012 




0.2 


1.59 


0.1233 


0.02567 


0.02833 




0.25 


1.31 


0.10833 


0.0133 


0.04767 




0.333 


0.89 


0.094433 


3.33 x 10" b 


0.094167 





2.D 



_i i i i i i i i i i i i i i i i i i i i i i_ 



OJU D.LD D.L3 D.2U ft3 0.30 
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we can see that there are 27 T's, 15 F's and 15 *'s. Thus when *'s are present, the ratio T/F is 
f| = 1.8. Without *'s, T/F would be ^ ~ 1.33 (see section Kf3 . Now, since the Second Moment 
Method requires Et = Sp, we can see that it is all the more artificial as T/F is large. Thus adding 
*'s should cut down the Second Moment Method's performance. 



2.7 Confrontation of the Second Moment Method with Reality 

Using SATLab, we investigate the behavior of real solutions and we emphasize how it differs from the 
conditions required by the Second Moment Method that we laid down just above. 

Based on numerical calculations of figures [T] and [2] we conjectured in section |2.5| that the Second 
Moment Method might work only if e VjW = r\ v r\ w . In this setting we showed that the independence 
point defined by fi a b — 6 a Sb and jt,u — fitfiu must be a maximum of T2, the second moment. This 
led us in section [2.6.1| to the following necessary condition for the Second Moment Method to work on 
boolean solutions: 6 = h. 

Now using SATLab, we are going to give experimental evidence that: 

• real solutions of standard 3-SAT violate all of these conditions: they are not independent at all! 

• real solutions of standard 3-NAE-SAT seem to be rather independent. 

These observations may explain why the Second Moment Method performs so poorly on standard 



3-SAT (cf. section 2.6.11 whereas it works pretty well on 3-NAE-SAT (cf. Achlioptas & Moore - 2002 



2.7.1 Distances between Solutions 

It turns out that in random 3-SAT, solutions are correlated with respect to their Hamming distances. 
Namely their Hamming distances are not centered around 50% contrary to solutions of random 3-NAE- 
SAT, but narrower to each other (cf. figure [3j. 

What we mean by Hamming similarity between two assignments is just the proportion of variables 
assigned the same value in both assignments. We took all couples of different solutions in a sample of 
random solutions output by a solver, and we plotted the frequency of the Hamming similarity. 

To have a more precise insight into Hamming similarity, we separated fixed and free variables. Let 
us recall that a variable is free iff flipping it yields another solution. We can notice that Hamming 
similarity is significantly greater among fixed variables than among free variables (see figure [4]) , and 
that it increases with c for both types of variables (see figures [5] and [6]) . 



19 



Observable=Distances 
Assignments=Solutions 
Range=AII_v ars 
N=3000 

C=2 
Seed=l 
Solwer=WalkSAT 
K-for-Standard_SAT=3 
K-for-Standard NAE=3 



1800 
1600 
1400 
1200 
1000 
800 
600 
+00 
200 




Powered bySATLab. 



Model = 
Model : 



Stand ard_NAE C 
Standard SAT C 




20 40 60 

Hamming similarity (%) 



30 



100 
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Figure 4: Hamming similarity is greater among fixed variables. 
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Figure 6: Hamming similarity among fixed variables increases with c. 
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Figure 7: The true surface of fixed variables decreases with c. 



2.7.2 Surface of True Literals 



What we call true surface is the scaled number of true occurrences of literals. We can see a fundamental 
difference between the true surface of fixed variables and the true surface of free variables. Namely 
the true surface of fixed variables decreases with c (figure [7]) whereas the true surface of free variables 
increases with c (figure pi) . Note that both quantities converge to roughly 0.56 (i.e. roughly |) when c 



approaches the threshold ratio, whereas in section 2.6.1 we got the following condition: = Y,p = | 
to make the Second Moment Method work. 

We interpret the ratio | as follows: the allowed types of clauses are 

{TXT, TTF, TFT, FTT, TFF, FTF, FFT} 

which amounts to 12 T's and 9 F's. Now = ±| = |. 



2.7.3 Non-Independence of True / False Surfaces 

Let us consider two solutions Si and S2 ■ We denote by Hp the false surface under solution Si , £t the 
true surface under solution S2 , and 'E.ft the surface which is false under Si and true under $2. In a 
given sample of random solutions, we took all couples of different solutions (Si, S2) and computed the 
ratio kg T ', the histogram in figure 9 plots the frequency of this ratio for the solutions of two different 
models of formulas: random S-NAE^SAT and random 3-SAT. Although some independence seems to 
exist in 3-NAE-SAT (i.e. the ratio is centered around 1), it can be seen that there is no independence 
of these surfaces for random 3-SAT. 

To have a more precise insight into the non-independence, we separated fixed and free variables. 
Let us recall that a variable is free iff flipping it yields another solution. We can notice that non- 
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Figure 8: The true surface of free variables increases with c. 
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Figure 9: (Non-)independence of surfaces in NAE-SAT and in SAT. 
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Figure 10: The non-independence of surfaces comes rather from fixed variables. 



independence comes from both free and fixed variables, but rather from fixed variables than from free 
variables, cf. figure [T0| 

2.7.4 Non-Independence of Clauses Types 

Here what we call clause type is the number of true occurrences of variables in the clause. Let us 
consider two solutions Si and S^. We denote by f3i the proportion of uniquely satisfied clauses under 
solution Si, b\ the proportion of uniquely satisfied clauses under solution S2, and 71.1 the proportion 
of clauses which are uniquely satisfied under S\ and under S^. In a given sample of random solutions, 
we took all couples of different solutions (Si, S2) and computed the ratio the histogram in figure 
[Tl] plots the frequency of this ratio for the solutions of two different kinds of assignments: solutions 
of a random 3-NAE-SAT formula and solutions of a random 3-SAT formula. Although independence 
seems to happen among solutions of random 3-NAE-SAT (i.e. the ratio is centered around 1), it can 
be seen that there is no independence in random 3-SAT. 

In real instances we can assume symmetry of occurrences, in the sense of section |2.1.6| so in the 
light of fact [3] we could conclude from the non-independence of surfaces observed in section |2.7.3| that 
in the real solutions of 3-SAT independence of clauses types would not hold. 

3 Second Moment Method on Distributional Random k-SAT 

Using the standard distributional model instead of the standard drawing model yields better upper 
bounds on the satisfiability threshold. Moreover, we would like to gain some more control over the 
proportion of variables assigned 1 according to the imbalance between their positive and negative 
occurrences. Namely, a variable is all the more expected to be assigned 1 in a solution as it has more 
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Figure 11: (Non-)independence of uniquely satisfied clauses in NAE-SAT and in SAT. 

positive occurrences, and vice- versa. At least this seems to happen on real solutions, see figure [l3| in 
section 13.6.11 

That is the reasons why we are going to implement the Second Moment Method in the distributional 
model. 

In this section we follow roughly the same outline than in section [2j but we focus on solutions only; 
we only emphasize the major differences with respect to the general framework of section [2] 

3.1 Preliminaries 

3.1.1 Occurrences and Signs 

We still have n variables. We denote by d Piq the fraction of variables having p positive and q negative 
occurrences. 



^2 {p + l)dp,q = kc . 
Occurrences and signs of variables are determined a priori. 

We ought to consider light and heavy variables C and T-L (cf. [4]). In fact we are going not to worry 
about that, because they make the calculation heavier, and in the end we shall see that there is no 
need to be rigorous since we only have negative results. 
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3.1.2 Values 

Given a boolean assignment, we denote by 8 p . q the proportion of variables with p positive and q 
negative occurrences which are assigned 1. Thus the proportion of variables with p positive and q 
negative occurrences which are assigned is 1 — 6 p . q . 

Given two assignments Si and S2, for all (a, b) £ D 2 , considering variables with p positive and q 
negative occurrences, we denote by: 

• X Pi q the proportion of variables which are assigned 1 in Si and S2] 

• the proportion of variables which are assigned 1 in Si and in 6*2; 

• \x p q the proportion of variables which are assigned in Si and 1 in S2', 

• Vp.q the proportion of variables which are assigned in Si and 52- 
We have the following constraints: 

3p,q — ^p,q 

3p,q — ^p,q + ^p,q 

1 ^p,q — f-^p,q ^p-q 



1 - 5p,q 



thus 



^p,q ^p>q 
^p,q = ^p,q 



fp,q f ^P-,q f^P><i 

which enables us to work only with // Pl9 . 



3.1.3 Truth Values 



Y c (P 5 P,q + <1 i 1 - 5 P,q)) d P,q 

(p.,q)eN 2 

71 F = Yc Y (<l S p,q + P i 1 - S p,q)) d p,q 

£ T,T = J- ^2 (P( S P>Q ~ Vp,q) + Q (! - S p,q ~ Vp,q)) d p,q 
(p.,q)eN 2 

£ F,F = J- ^2 (q (Sp,q - Pp^q) + P (1 - Sp,q ~ Pp,q)) d p , q 
£T,F = £F,T = ~j~ ^ (P + q) Vp,qdp,q 



(p,q)<£N 2 



3.1.4 Types of Clauses and Surfaces 



We keep the same definitions as in section |2.1| 

However, some extra constraints on surfaces occur in the distributional model, because here all 
occurrences and signs of variables are determined a priori: 

£„ = kr] v ; (16) 

1 — 'v,w ^£y,w ■ (l'' 7 ) 
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3.2 Expression of the First Moment 

The first moment of the number X of solutions can be split up into the following factors: total number 
of assignments and probability for an assignment to be a solution. 

1. total number of assignments: choose subsets of variables assigned or 1: 
tt I dp, q n \. 

iJ -(P>8)££ \8 Pjq d p , q nJ ' 

2. probability for an assignment to be a solution: 

(a) number of satisfied formulas: 

i. we give each clause an allowed type t £ T: ( IR cr \ ) 

ii. we find a permutation of the true literals into the true boxes and a permutation of the 
false literals into the false boxes: (r)Tkcn)\ {r}pkcn)\ 

(b) total number of formulas, i.e. number of permutations of the occurrences of literals into the 
boxes: (kcn)l 

We denote by V the set of all families of non-negative numbers ((Sp^)^ 9 ) E £ , (Pt)teT^) satisfying 

constraint [7] We denote by T(n) the intersection of V with the multiples of ~; we get the following 
expression of the first moment: 

EX = Yl T i( n ) 



where 



_ (p,q)eC \S Vtq d p<q n) \...(l3 t cn) teT ..J 

^ ' ( ken 



( ken \ 
yrj^ken) 

The exponential equivalent of T± (n) is T™, where 
3.3 Expression of the Second Moment 

The second moment of X can be split up into the following factors: total number of assignments and 
probability for an assignment to be a solution. 

1. total number of assignments: choose subsets of variables assigned or 1: 

tt / d p>q n S. 

•I MP,<?)££ \\ p , q d p-q n,fj, P:q d P]q n,fi P]q d p , q n,^ Piq d P:q nJ ' 

2. probability for an assignment to be a solution: 

(a) number of satisfied formulas: 

i. we give each clause two allowed types: { , ^ ) 

ii. we find a permutation of the literals into the corresponding boxes: 
(£T,Tkcn)\ (eT,Fkcn)\ (e^j-fcen)! (eF,Fkcn)\ 
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(b) total number of formulas, i.e. number of permutations of the occurrences of literals into the 
boxes: (kcn)\ 

We denote by "P 2 the set of all families of non-negative numbers U/Up,g)( p 9 ) S £ > (lt,u)(t «)gr 2 ) satisfying 

constraints [8] and [9] We denote by I2 {n) the intersection of V2 with the multiples of i ; we get the 
following expression of the second moment: 



EX 2 = Yl T ^ n ) 

((Mp,,) (p ,, )ec ,(7t, tl ) (tiU)er 2)eX2(n) 



where 



rr I d P,i n )( cn \ 

'■>-{p,q)£C \ A p-9 d p ^ q n, fj, p>q d p }q n,u pq d Ptq n,v Piq d p , q nl v..(7t, u cn), t u)er 2 ■•»' 

The exponential equivalent of T2 (n) is T^, where 



T 2( n ) - , ken 

\€t Tkcn,€T Fkcn,€F T^cn,€F F^cnJ 



To 



V 



si 

^p,ql 



£t t £t.f £f t £f f 



3.4 Expression of the Lagrangian 

When the parameters of the first moment (i.e. {5 p , q )^ p , {Pt) te q-) are chosen, T 2 must be maximized 
under constraints [8] and [9] That leads us to use the Lagrange multipliers method. In order to make the 
forthcoming maximization easier, we introduce some extra variables ip VtW which are going to simulate 
e VyW . The reason for this is that e VtW contains H Pt q, but we need the expression of /i Pj9 for our numerical 
calculations. So, because of equation [IT] we have the following constraints: 



kST,T 


= klpT,T 


= ^T,T 


ker.F 


= klpT,F 


= ^T,F 


kEF,T 


= klpF,T 


= ^F,T 


ks F,F 


= k?pF,F 


= ^F,F 



Using the facts that £t,t + £t,f — Vt, ^t,t + ^t,f = and St = krjr, we see that constraint 
£t,t = ipT,T is redundant. Eliminating Ef,t and £f,f as well, there remain the following 5 constraints: 



£t,f 

i>T,T 

4>T,F 

1pF,T 
1pF,F 



IpT.F 

VT - i>T,F 

k 

4>T,F 

TjF - 4>T,F 



(18) 
(19) 

(20) 

(21) 
(22) 
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So we define the following Lagrangian: 



p.q ^p-q I / -i jr \ | 1 ^p,9 Mp,9 



A = - d P: g [ (<5p >g - n PA ) In p,<? - p,<? + (1 - 5p, q - (i Ptq ) In 

(p,?)e£ 



e 



-2 ^ d Piq fi Ptq In - c ^ 7 Ml ln^+fcc ^ Vd,™ In — 
(p,?)6£ (t,M)er 2 ' (u^Oel^-F} 2 

+c ^ (In ft„ )tu ) (Hy,™ — kip v>w ) + fee (In 2/) (i/)t,f — £t,f) 

(v,w)e{T,F} 2 



teT VugT / «ew VteT 



7t,« - Ai 



3.4.1 Derivative with respect to j t .u 
OA 



In 7^ + c^ln/i ti)Uj + cln/i + clng^ 



Canceling out this derivative yields: 

3.4.2 Derivative with respect to ip v 

OA 



ft9ut[h u , Ui . (23) 



= kc In ip VtW - kc\nh v ^ w + fcc(lny) 1 v =taw=f 



Canceling out these derivatives yields: 

f/fv,w = h ViW y-^'= T ^ . (24) 
Thus constraints [18] [19] [20] [21] and [22] become: 

/lT,F 

£T,F 



y 

7 ^T,F 
"'T T — ??T — 

V 

hr,F _ St,f 
2/ fc 

ftf.T = 

y 

7 ^T,F 

«-f,f = ?7f 

2/ 
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3.4.3 Derivative with respect to jjb pq 



^- = d M In V™ ~ (1 2 ~ *>« ~ - (p + g) d p , q In y . (25) 

O^p^q ^ pq 

Canceling out this derivative yields: 
i.e. 

mL( 1 -^ +9 )-Mp,, + <Wi-<W> = o . 

Thus there are 2 cases to consider: 

1. case where y — 1 or p + q = 0: /i p , g = 5 p , g (1 — Sp,q)] 

2. case where y ^ 1 and p + g ^ 0: = 1± V /l 4 ^ 1 y _^ p |^''' ( ' 1 Ap ^ i; numerically we can find 
solutions with = — * 2(1-^+9) • 



3.5 Independence Point 



As in section |2.5| we define the independence point by [i v . q = S p ^ q (1 — S Ptq ) and "ft.u = PtPu- Again, 
we were able to make the Second Moment Method work only if e v _ w — r\ v r\ w . 

When e V}W = rj v rj w , we have ^§ = 1 (see proof of theorem jHJ) and the independence point is 

stationary without any extra condition (plug h v w — r) v rj w , y = 1, f t = rrfc ,3t — , g u = r , fc ' 3 " — into the 



constraints and equations 23 24 and 25 1 , assuming symmetry of occurrences as usual (and thus fact 
[3]) . By comparison with chapter |2J we could say that the extra condition we had there on the surfaces 
to make the independence point stationary corresponds here to the preliminary extra constraint |17| 

Moreover, it is noteworthy that, because of fact [3] and constraint |16| the independence point violates 
constraint 17 when e v ^ w ^ T) v r) w . 

So, what is the condition for s v>w = rj v r] w l 

3.5.1 Condition for e v , w = r\ v r\ w at the Independence Point 



As before in section 2.6.1 the first thing to notice is that if et.f — VtVf, then the three other identities 
follow, because Et.f + £t.t = f]T etc. 

We are grateful to Emmanuel Lepage, who gave us the main idea to compare St.f and r\TT\F- Let 
us make the following change of variables: S Piq = 5 + 3 p ,q : 

1 1 



Vt 



•> ■ kc Y (p-i)i, q d P ,q ; 

(p.q)eN 2 

VF = \- Y c Y fr-l) 6 P,q d P,Q > 
( P ,q)£N 2 

£T,F = \-y c Yj + 6 P,q d P,<l ■ 

' (p,q)eN 2 



Thus 
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e T ,F - VtVf = 



(P-Q) S p,q d P,q 



(p,q)£N 2 



kc 



(P + q) 5p,qdp,q 



(p,q)£N 2 



(fee 



q)£N- 



(P - l) $p,qdp,q 



< 



± v 

fee ^-^(p, 



q)£N 2 



(p - q) 8. 



P.q 



*p,q 



with equality iff 



(p ~ q) 8 p q nas the same sign wherever d p>q 7^ 0; 



2. Since p, q > 0, \p — q\ < p + q with equality iff p — or q = 0; 

2 



fee Z^(p,q)£N 2 



(p - q) 8 P ,, 



J-p,q 



< 



kc ^2(p,q)£N' 







5 'v,i 





with equality 



iff S p q — wherever 0, g 7^ and d PtQ 7^ 0; 
3. By the Cauchy-Schwartz inequality, 



< 




kc 



Y (P + q) 8p, q d p , 



9 ' 



(p.,q)£N 2 



with equality iff 



'p.q 



has the same value wherever (p + q) d pq 7^ 0. 



To conclude, £t,f < VtVf with equality iff (<5 = whenever (p + g) t2 P)g 7^ 0) or (<L „ is symmetric 
in p, g and the model has only pure literals). 

This means that in all models allowing non-pure literals (in particular the standard model having 
a 2D-Poisson d Pjq ), Et,f = VtVf iff 8p,q — \ whenever (p + q) d p>q 7^ 0. 

Consequently, even in the distributional model, we encounter the very restrictive condition S p . q = \ 
to make the Second Moment Method work. 

Numerically, we found a critical ratio of 2.838, thus very slightly above the 2.833 obtained in the 
drawing model (cf section 2.6.11. 

3.5.2 Attempts to evade the 8 p _ q = \ condition 

suggested us that on real solutions 5 Ptq 



The shape of 5 P:q on figures 
to evade the uj = 1 case. 

F, 



13 



and 



14 



So we tried 



We plotted In In j£ for different values of uj, at a point satisfying constraints 



21 and 22 for the best choice of the /3 t 's that we found complying with constrain 

f 2 _ 



incirBI the expe< 



20 



xpectec 



value is —00 iff ^| = 1. We set the ratio c = 0.1 (to be compared with 2.838, where uj = 1 works). 



Only uj = 1 seems to make ^ = 1, cf. figure 

11 1 

3.6 Confrontation with Reality 



12 



We are going to do the same kinds of observations through SATLab as in section 2.7 in order to figure 
out why the Second Moment Method still fails to give high upper bounds in the distributional model. 
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Figure 12: Setting S p . q = 1+ J P -<, makes the Second Moment Method work iff oj = 1. 



3.6.1 Non-Independence of Values 



We focus our attention on variables with T occurrences among which U are positive. We denote by d 
the average proportion of those variables assigned 1 by a solution and u the average proportion of those 
variables assigned and 1 by a couple of distinct solutions. In a given sample of random solutions, we 
took all couples of different solutions and computed the following three quantities: d, d_d = d (1 — d) 
and u. At independence we should have u — d_d, which happens for c = 2 (cf. figure 13 1 but not for 
c = 4 (cf . figure [l4 1 . 

Moreover we can see that d is almost linear in U when c = 2 but it curves when c = 4. Note also 
that the range of U may be strictly included in [0 . . . T] (cf. figure 14 1. Determining the shape of d 



might help do better calculations of first and second moments, even though in section [5.5.2| we took 
S<n n = -i i \- a but it could not make the Second Moment Method work. It is clear however that the 



condition 5 p _ q — | we encountered in section 3.5.1 does not hold on real solutions. 



3.6.2 Non-Independence of Surfaces 



We perform the same experiment as in section 2.7.3| b ut we restrict surfaces to variables having T 
occurrences among which U are positive. On figure 1 15] we can see that there is still no independence 
of surfaces, although the restriction of the surfaces to these variables curbs the non-independence. 



4 Conclusion on the Second Moment Method 

Contrary to the First Moment Method, which works more or less finely, the Second Moment Method 
will not always work. Furthermore, it is rather difficult to make it work, and we were able to make it 
work only under very artificial conditions with respect to reality. Moreover, even when it works, we 
have not been able to find strong lower bounds with it. We got stuck at 2.83 for 3 different models 
(standard drawing model, implicants and standard distributional model). 

However we did not prove that it is impossible to find better lower bounds with our general 
framework, this is just numerical experiments. Moreover our framework may not be perfect, perhaps 
the parameters we consider are not relevant for the Second Moment Method, so there is still hope in 
making the Second Moment Method work and give higher lower bounds on the threshold of 3-SAT. 
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Figure 13: S p . q and /z Pj(J = 8 p>q (1 — S Ptq ) at c = 2. 
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Figure 14: <5 Pi9 and \jl va ^ 5 p ^ q (1 — S p ^ q ) at c = 4. 
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Figure 15: The distributional model slightly curbs the non-independence of surfaces. 
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